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A potential cytosolic metallocarboxypeptidase from Burk- 
holderia cenocepacia has been crystallized and a synchrotron- 
radiation microfocus beamline allowed the acquisition of 
diffraction data to 1.9 A resolution. The asymmetric unit 
comprises a tetramer containing over 1500 amino acids, and 
the high-throughput automated protocols embedded in 
PDB_REDO were coupled with model-map inspections in 
refinement. This approach has highlighted the value of such 
protocols for efficient analyses. The subunit is constructed 
from two domains. The N-terminal domain has previously only 
been observed in cytosolic carboxypeptidase (CCP) proteins. 
The C-terminal domain, which carries the Zn 2+ -containing 
active site, serves to classify this protein as a member of the 
M14D subfamily of carboxypeptidases. Although eukaryotic 
CCPs possess deglutamylase activity and are implicated in 
processing modified tubulin, the function and substrates of the 
bacterial family members remain unknown. The B. cenoce- 
pacia protein did not display deglutamylase activity towards 
a furylacryloyl glutamate derivative, a potential substrate. 
Residues previously shown to coordinate the divalent cation 
and that contribute to peptide-bond cleavage in related 
enzymes such as bovine carboxypeptidase are conserved. The 
location of a conserved basic patch in the active site adjacent 
to the catalytic Zn 2+ , where an acetate ion is identified, 
suggests recognition of the carboxy-terminus in a similar 
fashion to other carboxypeptidases. However, there are 
significant differences that indicate the recognition of 
substrates with different properties. Of note is the presence 
of a lysine in the SI' recognition subsite that suggests 
specificity towards an acidic substrate. 
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1. Introduction 

Carboxypeptidases (CPs) hydrolyse peptide bonds in target 
proteins to release C-terminal amino acids. These enzymes 
have been classified into different families according to 
sequence similarity, mechanism and function (Gomis-Ruth, 
2008; Rawlings et al, 2012). One of the largest groups, carrying 
a single catalytic Zn 2+ in the active site, is the Ml 4 family. This 
is divided into four subfamilies known as M14A, M14B, 
M14C and M14D. One of the earliest and best studied family 
members is bovine carboxypeptidase A (CP-A; Christianson 
& Lipscomb, 1989). In general, M14A and M14B carboxy- 
peptidases either function within the secretory pathway or are 
themselves secreted. The members of the M14A subfamily are 
produced as pro-enzymes that are activated following the 
cleavage of N-terminal segments. The M14B subfamily are 
produced as active enzymes with a transthyretin-like domain 
at the C-terminus. The M14C group consists of bacterial 
enzymes related to y-D-glutamyl-(L)-meso-diaminopimelate 
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peptidase I (http://merops.sanger.ac.uk/) and process compo- 
nents of the bacterial cell wall. Like their M14A relatives, they 
also carry an N-terminal extension. 

The M14D subfamily has only recently been described and 
consequently there is only limited information regarding their 
structure and activity. They are referred to as cytosolic 
carboxypeptidases (CCPs; http://merops.sanger.ac.uk/) to 
reflect their cellular location. A search of available genomic 
data identified CCPs in many organisms (Otero et al, 2012) 
and the number of CCP paralogues varies between species 
(Rodriguez de la Vega et al, 2007). Four mammalian CCP 
proteins and a homologue from Caenorhabditis elegans called 
CeCCPP-6 have been identified as deglutamylases (Kimura et 
al, 2010; Rogowski et al, 2010). In addition to this activity, 
CCPs might process other C-terminal amino acids. For 
example, one member of the family has recently been reported 
to cleave the C-terminal tyrosine from a-tubulin (Sahab et al, 
2010). Less is known about the functional activity of the 
bacterial CCPs. A recent study of Pseudomonas aeruginosa 
CCP {PaCCP) failed to detect any activity towards known CP 
substrates, and the possibility has been raised that structural 
changes might be required to generate an active enzyme 
(Otero et al, 2012). 

Our interest in the bacterial homologues is twofold. It has 
proven difficult to obtain a recombinant source of mammalian 
CCP for characterization and therefore the prokaryotic 
proteins offer a surrogate system to inform on the post- 
translational modification of tubulin. We also note that in the 
important Gram-negative pathogen P. aeruginosa the gene 
encoding the CCP homologue (PA2831) has been identified as 
essential for the establishment of a lung infection in a murine 
model (Winstanley et al, 2009), and as such the CCP might 
represent a potential drug target for an important group of 
Gram-negative bacteria. As part of a wide-ranging project 
to advance early-stage drug discovery of novel antibacterials, 
we have adopted a structure-based approach to drive target 
assessment (Eadsforth et al, 2012; Moynie et al, 2013). Key to 
this process is to acquire material for ligand-binding studies: 
an accurate structural model which can be exploited to inform 
on aspects of druggability and crystals that might assist in 
fragment screening (Hunter, 2009). We initiated work on 
P. aeruginosa CCP (PaCCP; UniProt entry Q9I012) and 
on Burkholderia cenocepacia CCP (BcCCP; UniProt entry 
B4EEQ5). We made no progress with PaCCP, but a study has 
been published by others (Otero et al, 2012). Our work with 
BcCCP was more successful and here we report the prepara- 
tion of an efficient recombinant protein-production system 
and protocols for purification and crystallization together with 
the high-resolution structure. With only small crystals avail- 
able, it was necessary to exploit a microfocus beamline to 
obtain the diffraction data. BcCCP consists of 384 amino acids 
and forms a tetramer. The asymmetric unit comprises four 
subunits comprising in excess of 1500 amino acids. With a 
sizable model and high-resolution diffraction data with which 
to refine it, we sought an efficient route through the refine- 
ment process and incorporated the strategies employed in 
the PDB_REDO protocols (Joosten et al, 2011, 2012). We 



compare BcCCP with PaCCP (Otero et al, 2012), with two 
other bacterial CCP structures in the Protein Data Bank 
(PDB) and with mammalian carboxypeptidases of defined 
substrate specificity. These structures identify the key residues 
and motifs that are important for the stability of the CCP fold 
and the architecture of the catalytic centre, reveal the 
presence of a flexible loop at the periphery of the active site 
and suggest that the substrate of the bacterial proteins is 
acidic. Moreover, they support a druggability analysis of the 
active site. 

2. Materials and methods 

2.1. Reagents, preparation of a recombinant expression 
system and protein purification 

Chemicals and materials were sourced from Sigma- Aldrich 
and VWR International, except where stated otherwise. 
Genomic DNA from B. cenocepacia J2315 [Laboratory for 
Microbiology, Faculty of Sciences, Ghent University (LMG) 
strain 16656] was the template for PCR with the following 
primers designed to amplify the BcCCP open reading frame 
using Ndel and Xhol restriction sites (bold), respectively: 
5 -CAT-ATG-ACC-CTT-TCG-ATC-ACC-AG-3' and 5 CTC 
G AG-TTA- CG C- G A A- CGT- GTC-3 ' . The PCR product was 
ligated into pCR-Blunt II-TOPO (Invitrogen) prior to being 
subcloned into a modified pET-15b vector (Novagen), which 
produces a histidine-tagged protein with a Tobacco etch virus 
(TEV) protease site. The hexa-His tag, TEV cleavage site and 
additional residues from the choice of cloning sites added a 
total of 24 extra amino acids at the N-terminus. The recom- 
binant plasmid was amplified in Escherichia coli XL-1 Blue 
cells and the integrity of the gene sequence was verified by 
DNA Sequencing & Services (Dundee University) before 
being transformed into E. coli ArcticExpress (DE3) compe- 
tent cells for protein production. Cells were grown in 1 1 
Luria-Bertani medium supplemented with 50 |ig ml -1 carbe- 
nicillin. Gene expression was induced at 310 K using 1 mM 
isopropyl /3-D-l-thiogalactopyranoside and growth continued 
for 72 h at 286 K. 

The cells were harvested by centrifugation (30 min at 3500g 
and 277 K), resuspended in lysis buffer (50 mM Tris-HCl pH 
7.5, 250 mM NaCl) containing DNase I (0.1 mg) and a single 
tablet of a cocktail of EDTA-free protease inhibitors (Roche) 
and lysed using a French press at 110 MPa. Insoluble debris 
was separated by centrifugation at 37 500g for 30 min at 277 K. 
The soluble fraction was loaded onto a 5 ml HisTrap HP 
column (GE Healthcare) and a linear imidazole concentration 
gradient was applied to elute BcCCP. Fractions were analyzed 
using SDS-PAGE and those containing BcCCP were pooled. 
The hexa-His tag could not be cleaved, so protein still carrying 
the affinity tag was further purified using a Superdex 200 26/60 
size-exclusion column (GE Healthcare) equilibrated with 
50 mM Tris-HCl, 250 mM NaCl pH 7.5. This column had 
previously been calibrated using the molecular-mass standards 
thyroglobulin (670 kDa), y-globulin (158 kDa), ovalbumin 
(44 kDa), myoglobin (17 kDa) and vitamin B 12 (1.35 kDa) 
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(Bio-Rad; Fig. 1). Selected fractions were pooled, dialyzed 
into 100 mM sodium acetate, 150 mM NaCl pH 5 and 
concentrated to 10 mg ml -1 using Amicon Ultra devices 
(Millipore). The protein concentration was determined spec- 
trophotometrically using a theoretical extinction coefficient 
of 73 590 AT 1 cm" 1 at 280 nm calculated using ProtParam 
(Gasteiger et al, 2005). The sample purity was verified by 
matrix-assisted laser desorption/ionization time-of-flight mass 
spectrometry and SDS-PAGE (data not shown), with typical 
protein yields in excess of 20 mg per litre of culture. Native 
PAGE and size-exclusion chromatography with multi-angle 
light scattering (SEC-MALS) were used to establish the 
oligomeric state of the protein. SEC-MALS experiments were 
performed on a Dionex Ultimate 3000 HPLC system with an 
inline Wyatt miniDAWN TREOS multi-angle light-scattering 
detector and an Optilab T-rEX refractive-index detector. 
SEC-MALS analysis was performed on an Superdex 200 10/ 
300 column (GE Healthcare) equilibrated with 50 mM Tris- 
HC1, 250 mM NaCl pH 7.5. Molar masses spanning the elution 
peaks were calculated with ASTRA v.6.0.0.108 (Wyatt). 



2.2. Crystallization and data collection 

Crystallization trials were carried out with a Phoenix 
liquid-handling system (Art Robbins Instruments/Rigaku) 
and commercially available screens (Hampton Research). 
100 nl protein solution (10 mg ml -1 in 100 mM sodium acetate 
pH 5.0, 150 mM NaCl, 0.5 mM ZnS0 4 ) was mixed in a 1:1 ratio 
with reservoir solution and equilibrated against 70 ul reservoir 
solution at 293 K. Orthorhombic rod-shaped crystals of 
approximately 30 x 10 x 10 um in size were observed after 3 d 
using a reservoir consisting of 0.2 M Li 2 S0 4 , 25% PEG 3350, 
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Figure 1 

Investigating the quaternary structure of BcCCP. A gel-filtration chromatogram for BcCCP is 
displayed on the left. The protein eluted with an apparent molecular mass of 140 kDa as calculated 
from a previously determined calibration curve. Native PAGE analysis of purified BcCCP is shown 
on the right. The 4-16% native PAGE (Novex) gel shows protein (lane 1) and molecular-mass 
standards (lane 2; labelled in kDa). 



0.1 M bis-tris pH 5.5. A single crystal was transferred into a 
cryosolution consisting of the original reservoir supplemented 
with 20% glycerol prior to flash-cooling at 100 K and was then 
characterized in-house using a MicroMax-007 rotating-anode 
generator and an R-AXIS IV ++ dual image-plate detector 
(Rigaku) prior to storage in liquid nitrogen. X-ray diffraction 
data extending to 1.9 A resolution were subsequently 
collected on the ID23-2 microfocus beamline at the European 
Synchrotron Radiation Facility (ESRF). Owing to the size of 
the crystal and its particularly long c axis, it was necessary to 
utilize a microfocus beam of approximately 10 um and a 
helical data-collection strategy. Such a data-collection strategy, 
combined with a short exposure time of 0.6 s and a MAR 
Research Mosaic 225 charge-coupled device detector, allowed 
good separation of the reflections, appropriate sensitivity and 
low X-ray damage, allowing a complete data set to be 
collected. Integration and scaling of the data were carried out 
using MOSFLM (Leslie, 2006) and SCALA (Evans, 2006). 
The space group was P2 1 2 1 2 1 , with unit-cell parameters a = 62.9, 
b = 85.9, c = 289.0 A. The asymmetric unit consists of four 
subunits each of mass 45.3 kDa, with approximately 1500 
residues in total, a V M value of 2.17 A 3 Da -1 and a solvent 
content of approximately 45%. 

2.3. Structure solution and refinement 

The structure was solved by molecular replacement using a 
polyalanine model of the monomer of a CCP structure from 
B. mallei (BmCCP), which shares 85% sequence identity 
(PDB entry 3k2k; Joint Center for Structural Genomics, 
unpublished work). The rotation and translation functions in 
Phaser (McCoy et al, 2007) revealed the positions of four 
polypeptide chains with a log-likelihood 
gain of 6469. Inspection of the mole- 
cular packing in Coot (Emsley et al., 
2010) showed no steric clashes and 
suggested that a tetramer, consistent 
with the SEC-MALS results (discussed 
below), was present. Rigid-body refine- 
ment was carried out in REFMAC5 
(Murshudov et al., 2011). The resolution 
range was set from 144.5 to 2.5 A and 
the R free value was 33.3% following 
rigid-body refinement. Ten cycles of 
restrained refinement were then carried 
out with the resolution extended to 
1.9 A. The i? WO rk and R free values were 
24.6 and 28.6%, respectively. Chain A 
was inspected with electron-density and 
difference density maps in Coot and 
some missing side chains were added, 
while those out of density were deleted. 
Chain A was rotated and translated to 
provide the other three chains in the 
asymmetric unit and the four Zn 2+ ions 
were included. The model was subjected 
to several cycles of PDB_REDO 
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Table 1 

Crystallographic statistics. 

Values in parentheses are for the highest resolution shell. 



Space group 


P2 1 2 1 2 1 


Unit-cell parameters (A) 


a = 62.90, b = 85.95, c = 289.02 


Resolution (A) 


40-1.90 (2.00-1.90) 


No. of reflections recorded 


516512 (74589) 


Unique reflections 


124234 (17924) 


Completeness (%) 


99.9 (100.0) 


Multiplicity 


4.2 (4.2) 




8.4 (2.7) 


Wilson B (A 2 ) 


13.3 


Radiation source and beamline 


ID23-2, ESRF 


Wavelength (A) 


0.873 


Residues 




Subunit A 


2-156, 159-383 


Subunit B 


2-318, 320-384 


Subunit C 


2-156, 159-314, 320-384 


Subunit D 


1-315, 320-384 


No. of waters 


1062 


No. of glycerol molecules 


9 


No. of ethylene glycol molecules 


7 


No. of PEG molecules 


1 


No. of Cl — ions 


3 


No. of Zn 2+ ions 


4 


No. of acetate ions 


4 


-"merge i V 


13.0 (48.4) 


lv workT' iv treeo \ /u / 


16.4/20.5 


Mean B factors (A 2 ) 




Subunits A, B, C, D 


6.7 


Ligands and waters 


26.2 


Cruickshank DPI^f (A) 


0.13 


Ramachandran plottt 




Most favoured (%) 


96.9 


Additional allowed (%) 


2.9 


Outliers (%) 


0.2 


Deviations from ideal values^ 




R.m.s.d., bond lengths (A) 


0.01 


R.m.s.Z, bond lengths 


0.56 


R.m.s.d., bond angles (°) 


1.14 


R.m.s.Z, bond angles 


0.62 



t ^ merg e = Em/ Ei \h(hkl) ~ ( 7 (^0>I/Em, T.ih(hkl\ where Ilhkl) is the intensity of 
the ith measurement of reflection hkl and (I(hkl)) is the mean value of hQikl) for all i 
measurements. % R wolk = J^hki \ \ F obs I - locale I |/Emz l^obsl. where F obs is the observed 
structure factor and F calc is the calculated structure factor. § R fTee is the same as ^ work 
except calculated with a subset (5%) of data that were excluded from refinement 
calculations. Diffraction-component precision index (Cruickshank, 1999). ff Chen 
et al. (2010). tt Engh & Huber (1991). 

refinement (Joosten et al., 2012) followed by electron-density 
and difference density map inspection, rebuilding and inclu- 
sion of water molecules and ligands. Translation/libration/ 
screw analysis (TLS; Painter & Merritt, 2006) was applied to 
create different group descriptions. 

The PDB_REDO process consisted of the following steps, 
(i) The B factor was reset to the Wilson B followed by ten 
cycles of TLS refinement using one TLS group per chain, (ii) 
Seven ten-cycle trial refinements were performed keeping the 
TLS model established in (i) fixed, using automatic geometric 
restraint weighting from REFMAC5 and trying increasingly 
loose ^-factor restraint weights. The optimal weight was 0.30, 
which is lower than the default, (ii) Six 30-cycle refinements 
were performed using the fixed TLS model, the optimized 
5-factor restraint weight and increasingly tight geometric 
restraint weights. A restraint weight of 0.10 gave the best 
refinement results. Details of the selection algorithm are 
presented in Joosten et al. (2012). (iv) Model rebuilding 



consisting of water deletion, peptide flipping, side-chain 
rotamer searching and His/Asn/Gln flipping was performed to 
improve hydrogen bonding (Joosten et al., 2011). (v) A final 
refinement consisting of five cycles of TLS refinement starting 
from the previously refined model, followed by ten cycles of 
restrained refinement using a ^-factor weight of 0.30 and 
trying three different geometric restraint weights (0.05, 0.10 
and 0.15) was performed. Refinement with a slightly looser 
geometric restraint weight than before (0.15) gave the best 
results. 

A further round of map inspections, model fitting and 
PDB_REDO refinement was carried out. The output model 
was again inspected with the relevant maps and adjusted, 
further waters were added and multiple conformers were now 
included at several positions. Further restrained refinement 
followed, using the ^-factor and geometric weights established 
with PDB_REDO and a TLS description comprising 12 
groups in total. Local noncrystallographic symmetry (NCS) 
restraints were imposed at the onset of the refinement and 
were maintained until the final round of calculations. The 
crystallographic statistics and model geometry are shown in 
Table 1. Coordinates and structure factors have been depos- 
ited in the PDB with accession code 4b6z. The model 
geometry was monitored with MolProbity (Chen et al, 2010). 
Structure superpositions were calculated using Secondary 
Structure Matching (Krissinel & Henrick, 2004) in Coot (using 
LSQKAB; Kabsch, 1976), and the Protein Interfaces, Surfaces 
and Assemblies (PISA) server (Krissinel & Henrick, 2007) 
was used to characterize the protein-protein oligomerization 
interface. Amino-acid sequence alignments were performed 
with Clustal Omega (Sievers et al., 2011). Active-site volumes 
were calculated using ICM PocketFinder (MolSoft Limited). 
Secondary structure was assigned using Coot and by inspec- 
tion. Figures were prepared using PyMOL (DeLano, 2002) 
and ALINE (Bond & Schiittelkopf, 2009). 



3. Results and discussion 

3.1. General comments 

Size-exclusion chromatography was used as the final puri- 
fication step and the protein eluted as a single species with a 
mass of approximately 140 kDa; a native PAGE gel showed 
a band of around 146 kDa (Fig. 1). The theoretical mass of 
BcCCP is approximately 45.3 kDa; these data were therefore 
taken to infer the presence of an oligomer, but it was unclear 
what the quaternary structure was. However, SEC-MALS 
shows that the protein elutes with a mass of 174 kDa (Fig. 2) 
corresponding to a tetramer, and this is consistent with the 
assembly observed in the crystal structures of BcCCP (the 
subunits are labelled A-D; Figs. 3 and 4), BmCCP and the 
protein from Shewanella denitrificans (SdCCP; PDB entry 
312n; Joint Center for Structural Genomics, unpublished 
work). This is distinct from PaCCP, which was reported to 
behave as a monomer in solution (Otero et al., 2012). The 
tetramer has a gap or hole at the centre of the assembly and 
this may have influenced the elution properties, whereas the 



282 Rimsa et a/. • M14-type cytosolic carboxypeptidase 



Acta Cryst. (2014). D70, 279-289 



research papers 



use of light scattering offers a better approximation of the 
assembly size. 

Diffraction data were acquired by exploiting the use of a 
microfocus synchrotron-radiation beamline to accommodate 
the small size of the crystal and to help circumvent the issue 
of a long unit-cell edge. The structure was solved by molecular 
replacement and, in order to carry out the efficient refinement 
of a relatively large asymmetric unit with over 1500 residues 
and data to high resolution, we adopted a slightly unconven- 
tional route by using PDB_REDO. This procedure uses a 
variety of automated protocols to test different models for B 
factors using weighting restraints on geometry, optimization 
with respect to diffraction data terms and side-chain 
rebuilding, for example (Joosten et al, 2012). The approach 
has been developed to efficiently apply modern refinement 
techniques to update and improve the contents of the PDB. 
It has been our experience that the majority of structures that 
we have inspected have been improved in terms of agreement 
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Figure 2 

SEC-MALS analysis of BcCCP. The horizontal blue line corresponds to 
the calculated mass of BcCCP, which was determined to be 174 kDa. The 
theoretical mass of a monomer is 45.3 kDa; thus SEC-MALS indicates 
the presence of a tetramer. 



Subunit A 



Subunit 5 
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Figure 3 

The tetramer of .BcCCP. The protein is displayed as a semi-transparent 
van der Waals surface over the ribbon cartoon. Subunits are labelled A 
(yellow), B (green), C (cyan) and D (blue). Asterisks mark the positions 
of disordered loops. 



with the diffraction data and in many cases in chemical 
correctness. The approach worked well in this refinement and 
we suggest that the lessons encapsulated in PDB_REDO have 
much to offer especially if incorporated early in the refine- 
ment process. We suggest that near the end of the refinement 
and building process, when interactive rebuilding is limited to 
the final polishing of the model, it is no longer advantageous to 
perform subsequent PDB_REDO runs. Instead, the optimized 
refinement settings from the last PDB_REDO run can be used 
in REFMAC5 directly. To this end, PDB_REDO has been 
updated to provide a keyword file for REFMAC5 that can be 



Subunit A 




Subunit A 



Figure 4 

The interfaces between subunits A and B (a) and subunits A and C (b) in 
the tetramer. The secondary-structure elements which contain residues 
contributing to tetramer formation are labelled. The N- and C-termini are 
labelled and the disordered loops are marked with asterisks. The dashed 
line points to the N-terminal end, which is located at the back and is not 
visible in the cartoon. 
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used in the CCP A graphical user interface (Winn et al, 2011) to 
quickly import these settings. 

Continuous well defined electron density is observed for 
most of the structure. The only disordered regions include the 
N- and C-terminal residues, residues 157-158 in chains A and 
C and a number of residues in a loop (315-320) in subunits B, 
C and D. Superposition of C a positions of all four chains gives 
root-mean-squared deviations (r.m.s.d.s) in the range 0.23- 
0.35 A, indicating that the subunits are, within the errors 
associated with the structure, essentially identical. Note that 
the diffraction-component precision indicator (Cruickshank, 
1999) was calculated to be 0.13 A. We therefore detail only 
subunit A unless stated otherwise. 

A strong feature in electron-density and difference density 
maps was observed in each active site near Arg226. These 
were modelled as acetates derived from the 50 mM sodium 
acetate present in the crystallization conditions. In the crystal 
structure of PaCCP complexed with guanidinoethylmercapto- 
succinic acid (Otero et al, 2012; PDB entry 4a39) and in the 
structure of CP-A with benzylsuccinic acid (Christianson & 
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Figure 5 

(a) The secondary and tertiary structure of .BcCCP. A ribbon diagram of subunit A showing 
a-helices coloured magenta and /3-strands coloured orange, with Zn 2+ depicted as a grey sphere. The 
N- and C-terminal positions are labelled, as are the a-helices and /3-strands. A small section of a 
loop (residues 157 and 158, marked with black asterisks), as well as the first and last residues at the 
N- and C-termini, are absent from the model owing to disorder, (b) The primary and secondary 
structure of BcCCP. Residues that are strictly or highly conserved in BcCCP, BmCCP, SdCCP, 
PaCCP and CeCCPP-6 are shown on black and grey backgrounds, respectively. Three residues that 
coordinate Zn 2+ are marked with yellow stars. The three motifs of the N-terminal domain which are 
conserved in CCP members are enclosed by green boxes. 



Lipscomb, 1986; Mangani et al, 1992; PDB code lcbx) the 
carboxyl groups of the ligands are located at the same position 
as this acetate (data not shown). 

3.2. Subunit structure and comparison with other CCPs 

The structure of BcCCP consists of two domains, a common 
feature of M14 family members. The primary, secondary and 
tertiary structures are presented in Fig. 5. Residues Metl- 
Glull2 form the N-terminal domain, while the remaining 272 
residues make up the CP domain. The N-terminal domain 
folds into a nine-stranded antiparallel 0-sandwich. Strands 02, 
03, 05, 08 and 07 form one sheet and strands 01, 04, 09 and 06 
form the other. This domain is specific to CCP proteins and is 
absent in other carboxypeptidases. It has been hypothesized 
that the N-terminal domain might contribute to folding, might 
have a regulatory function and/or might be involved in binding 
other proteins (Kalinina et al. , 2007; Rodriguez de la Vega et 
al., 2007). A comparison of the structure of the N-terminal 
domains of BcCCP and PaCCP gave an r.m.s.d of 0.87 A over 
109 C a positions, indicating close simi- 
larity. 

The CP domain of BcCCP displays a 
similar topology of secondary-structure 
elements, an a/ft/a sandwich structure 
with a seven-stranded antiparallel 0- 
sheet, as observed in other M14 family 
carboxypeptidases, for example bovine 
carboxypeptidase A (CP-A; Fig. 6). A 
superposition of BcCCP and bovine 
CP-A (Kim & Lipscomb, 1990; PDB 
entry 6cpa), which share about 20% 
sequence identity, gives an r.m.s.d of 
2.03 A for 225 C a positions. The cores of 
the structures fit well (Fig. 6), but on the 
periphery of the active site there are 
large differences that will be described 
later. Overlay of the CP domain of 
BcCCP with those of BmCCP (PDB 
code 3k2k), SdCCP (PDB entry 312n) 
and PaCCP (PDB entry 4a39; Otero et 
al, 2012) reveals a higher degree of 
conservation, with r.m.s.d values of 
0.45 A (374 C a atoms), 0.85 A (362 C a 
atoms) and 1.07 A (365 C a atoms), 
respectively. These proteins share about 
85, 50 and 40% sequence identity with 
BcCCP, respectively, and the degree of 
structural similarity correlates well with 
the level of sequence conservation. 

BcCCP contains several motifs which 
are conserved in CCP family proteins. 
One of them, Asn216-Pro217-Asp218- 
Gly219, occurs between 013 and a4, 
with the proline at the start of the helix 
(Fig. 5b). This motif contributes to 
domain folding and the formation of 
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part of the active site (Fig. 7). On one side of the motif Asp218 
establishes hydrogen bonds to the side chains of Thrl40 and 
Argl44 and to the main-chain amide of Glul42. These inter- 
actions serve to link /313 to /310 and which are on the 
surface of the domain. On the other side of the motif 
Asn216 O 51 accepts a hydrogen bond from the amide of 
Gly219 to hold the side chain in place and N 52 is a donor in 
hydrogen-bond formation to the main-chain carbonyl of 
Glnl70 in /312, thus linking different elements of secondary 
structure together. The side chain of Glnl70 also participates 
in four hydrogen bonds that link /312 to /314 and the segment 
between a4 and a5. The /3-strand link (not shown) involves the 



donation of a hydrogen bond from Glnl70 N e2 to Asp266 O . 
The other three hydrogen bonds, to the carbonyl of Leu234 
and the side chains of Asn235 and Trp238, help to position /312 
and the interhelical segment. The positioning of /312 and /314 
contributes to the formation of the active site because they 
provide Hisl71 and His268, respectively, to coordinate the 
catalytic Zn 2+ . 

Mammalian CCP proteins are considerably larger than their 
bacterial homologues, with N- and C-terminal extensions plus 
numerous insertions. The sequences of the extended regions 
vary and their function is at present not known. However, all 
CCP members share three short sequence motifs within the 

N-terminal domain which are 
absent in other CP families 
(Kalinina et al, 2007). These 
conserved motifs are located in 
close proximity, even though they 
are distant in the primary amino- 
acid sequence (Figs. 5 b and 8). 
Motif I, F(D,E)xGx(L,I), residues 
9-14, links /31 with /32 and extends 
to form part of the second strand. 
Motif II, W(F,Y)(Y,N)(F,Y), resi- 
dues 40-43, forms a major part of 
/34, a buried strand, and with the 
sequence Trp-Phe-Tyr-Tyr contri- 
butes significantly to the forma- 
tion of the hydrophobic core of 
the domain together with other 
aromatic residues such as Phe9, 
Phe57, Trp69, Trp83, TyrllO and 
Phelll (data not shown). Resi- 
dues 113-115, P(F,Y)(S,T), form 
motif III at the N-terminus of al 
on the surface of the domain. This 
segment forms part of the inter- 
face between the N-terminal and 
CP domains and contributes to 
interactions with the partner 
subunits in the dimers (A-B and 
C-D) forming a tetramer. It is 
noteworthy that these motifs and 
features are conserved in the 
PaCCP, BmCCP and SdCCP 
structures (Fig. 5b), and this 
observation supports the idea that 
together they are important for 
the correct folding of the domain 
and its position with respect to the 
CP domain. 

3.3. Quaternary structure 

BcCCP forms a tetramer 
arranged as a dimer of dimers 
(A-B and C-D in Fig. 3). The 
N-terminus of each BcCCP 




Figure 6 

Stereoview C a trace showing the superposition of .BcCCP (black) with bovine CP- A (red; PDB entry 6cpa). 
The a6-al loop of BcCCP is drawn in cyan. The active-site Zn 2+ is shown as a grey sphere and a loop that 
remains disordered in BcCCP is marked with two asterisks. 




Figure 7 

The conserved Asn-Pro-Asp-Gly motif. The left panel shows the residues in stick representation and 
selected hydrogen-bonding interactions as dashed black lines. The Asn-Pro-Asp-Gly residues are shown in 
cyan. The right panel shows a ribbon diagram of a subunit to indicate the position of this motif with respect 
to the overall fold. 
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subunit is located close to the CP domain of the dimer partner 
(Fig. 4a). This might explain why the N-terminal hexa-His tag 
could not be cleaved with TEV protease. The average solvent- 
accessible surface area of the four subunits in the asymmetric 
unit is approximately 15 500 A 2 . The total solvent-accessible 
area of the BcCCP tetramer is 52 600 A 2 , with a buried area of 
9400 A 2 . The interface formed between two BcCCP dimers 
(A-B and C-D) occludes 2800 A 2 , while the interface between 
subunits A-C and B-D equates to 1900 A 2 . Therefore, about 
9% of the surface area of each subunit is involved in dimer 
formation, with a further 6% involved in associations to 
assemble a tetramer. The free energy of dissociation is esti- 
mated to be 14.9 kcal mol -1 , suggesting that the tetrameric 
assembly is thermodynamically stable. These values correlate 
well with those calculated for tetrameric BmCCP and SdCCP 
proteins. For example, the total surface areas of the BmCCP 
and SdCCP structures are 52 500 and 49 900 A 2 , with buried 
areas of 11 900 and 9600 A 2 , respectively. The secondary- 
structure elements forming the interface within a BcCCP 
tetramer are shown in Figs. 4(a) and 4(b). Individual BcCCP 
dimers are stabilized by around 24 hydrogen bonds, nine salt 
bridges and van der Waals interactions. Some of the residues 
contributing to dimer formation include Glnl27 and Glull7 in 
al, Aspl04 in a loop between £8 and 09, Serl05 in 09, Arg80 
in a loop between 06 and 07, Glul42 in a loop between 010 
and 011, Vall34 in 010 and Asn8 in 01. We note eight 
hydrogen bonds and two salt bridges between a pair of dimers 
in a tetramer. A few of the residues involved in these inter- 
actions include Aspl93 in a loop between al and a3, Lysl86 in 




Figure 8 

Three motifs common to the CCP family: motif I (marine), motif II 
(orange) and motif III (purple). The active-site Zn 2+ is shown as a grey 
sphere and the three conserved motifs of the N-terminal domain that are 
found in all M14D family members are shown in marine and are labelled 
accordingly. 



al, Arg201 in a3 and Pro354 in a loop between 016 and aS. 
These residues are not conserved in PaCCP, which is a 
monomer in solution. However, these residues are well 
conserved in BmCCP and SdCCP, in which similar subunit- 
subunit associations are noted in tetramer formation (data not 
shown). 

3.4. The active site and specificity 

The active site of BcCCP is an elongated cavity with a base 
formed by residues in 014, 015 and 016, walls formed by loops 
linking a4-a5, a6-al, 03-04 and 05-06 and then the 014-015 
turn (Figs. 5 and 9). Here, the catalytic Zn 2+ is coordinated by 
two histidines (Hisl71 and His268), Glul74 in a chelating rf 
interaction and a water molecule to give a pentacoordinated 
distorted tetrahedral geometry (Fig. 10). The active-site cleft 
is occupied by an acetate derived from the crystallization 
conditions. This anion, which marks the binding site for the 
substrate carboxylate, is held in place by electrostatic inter- 
actions with Arg226, Asn235 and Arg236 (Fig. 10). 

The bacterial CCP active sites, and in particular the metal- 
ion environment (Fig. 11), display a high degree of structural 
conservation and this even extends to bovine CP-A, although 
here the overall sequence identity is only about 20% (see 
§3.2). The Hisl71 and Glul74 residues occur in a His-Pro-Gly- 
Glu motif which is highly conserved in CCP proteins (Fig. 5b). 
In BcCCP and BmCCP a buried Argl69 side chain is posi- 
tioned by a salt bridge to Glul78 and a hydrogen bond to the 
carbonyl of Tyrll4, and in turn helps to stabilize the confor- 
mation of the His-Pro-Gly-Glu motif by making a hydrogen 
bond to the main-chain carbonyl of Prol72 (data not shown). 
In the other known structures different residues occupy the 
space near the motif and fulfil the role of stabilizing part of the 
metal ion-binding site. In bovine CP-A Phell6 provides steric 
bulk and Gln76 provides the hydrogen bond to the active-site 




Figure 9 

The active-site cavity of .BcCCP. The protein is depicted as a semi- 
transparent van der Waals surface over the ribbon cartoon. The surface is 
coloured according to atom type: C, white; O, red; N, blue. Zn 2+ is shown 
as a grey sphere. Key contributors to the formation of the active site and 
the SI' pocket are labelled. 
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Figure 10 

The active site and Zn 2+ coordination in BcCCP. Selected residues in the 
active site and acetate are displayed and coloured as follows: C, white 
(yellow for acetate); O, red; N, blue. The 2F Q — F c electron-density map is 
shown for acetate, where F 0 represents the observed structure factors and 
F c represents the calculated structure factors. The map is depicted at the 
lcr level (cyan chicken wire). The cation and a water ligand are shown as 
spheres coloured grey and marine, respectively. Zinc coordination is 
shown by black continuous lines; blue dashed lines represent hydrogen- 
bonding interactions. 




Y315 \ 
Figure 11 

A comparison of the BcCCP and bovine CP-A active sites. The 
coordinates of CP-A were those of PDB entry 6cpa and the figure is 
based on the superposition shown in Fig. 2. Residue side chains are 
depicted as sticks with atomic positions coloured red for O, blue for N and 
green for C for bovine CP-A or all in cyan for .BcCCP. The C* atom of 
Gly253 of CP-A is shown as a small black sphere. The catalytic cations are 
shown as overlapping grey spheres. 



proline to position this metal-binding motif (Christianson & 
Lipscomb, 1989). 

A proposed mechanism for bovine CP-A involves the 
activation of a water molecule by the divalent cation assisted 
by Glu270 acting as a general base and interacting directly 
with substrate. This supports the generation of a potent 
nucleophile (Christianson & Lipscomb, 1989). Argl27 helps to 
stabilize an oxyanion hole by binding to the substrate carbonyl 
(Kim & Lipscomb, 1990). These key residues are conserved 
as Arg226 and Glu344 in BcCCP (Fig. 11). Other residues 
conserved in the active site throughout the M14 family include 
Asnl44 and Argl45 of bovine CP-A, which bind to the 
C-terminal carboxylate of the substrate. In BcCCP they are 
conserved as Asn235 and Arg236. 

Whilst most of the CCP polypeptide chains overlay well, as 
described, there is a noteworthy exception, part of the flexible 
a6-al loop along one side of the active site, which displays 
different conformations (Fig. 6). Similar observations about 
flexibility and different configurations of this part of the active 
site have been made when comparing bovine and rat 
carboxypeptidase structures (Faming et al., 1991). 

The inherent flexibility of this loop is indicated by the 
thermal parameters. In BcCCP the thermal parameters for the 
C a atoms in this section, residues His313-Tyr320, average at 
27.4 A 2 compared with an overall average of 6.7 A 2 for all 
protein atoms. The structural difference is best exemplified by 
comparing the positions of Lys319 and Tyr320 of the two most 
conserved structures, BcCCP and BmCCP (Fig. 12). The 
BcCCP structure displays an open configuration and the 
BmCCP structure a closed configuration with Tyr320 directed 
into the active site. The C a positions of this pair of residues 
differ by just over 7 A. 




Figure 12 

The flexible a6-al loop. The BcCCP and BmCCP (PDB entry 3k2k) 
subunit structures are overlaid and the C a trace for residues 315-326 is 
shown in green and blue, respectively. The side chains of Lys319 and 
Tyr320 are shown as sticks. 
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An overlay of BmCCP (PDB entry 3k2k) and CP- A (PDB 
entry lcbx), both displaying the closed configuration, places 
the hydroxyl groups of Tyr320 and Tyr248 within 1 A of each 
other (data not shown). In BcCCP Tyr320 is directed out and 
away from the active site and a major conformational change 
of about 17 A would be required to place the hydroxyl in the 
active site in a similar position to that of Tyr248 in CP-A. In 
CP-A the tyrosine adopts an outward configuration in the 
absence of ligand but swings in when a ligand, for example 
glycyl-tyrosine, is present. Site-directed mutagenesis suggests 
that it is not a critical determinant of the carboxypeptidase 
reactivity since k cat is unaffected for small substrates. 
However, a significant increase in K m indicates a role in ligand 
binding, so this tyrosine and the pliable loop in which it resides 
may assist in the recognition and binding of protein substrates 
(Gardell et al, 1985). Subsequent mutagenesis studies led to 
the proposal that the tyrosine hydroxyl contributes to the 
mechanism by helping to activate the catalytic Zn 2+ -bound 
water (Cho et al, 2001). We cannot find any structural data to 
support such a conclusion since the functional group on the 
tyrosine side chain is too distant to make a hydrogen bond to 
the catalytic water that is bound to Zn 2+ and there is already a 
glutamate contributing such a function. Rather, we agree with 
Gardell etal. (1985) that this tyrosine is well positioned to bind 
substrates and to assist in fulfilling the hydrogen-bonding 
capacity of substrates, intermediates or products during the 
catalytic cycle. There is another tyrosine, Tyr315, positioned 
in the active site a little closer to the catalytic Zn 2+ and the 
associated water molecule. This then provides a tyrosine pair 
that might contribute to substrate placement in BcCCP. 

The substrate specificities of bovine CP-A and rat CP-A2 
are well understood (Faming et al, 1991) and structural 
differences between CP-A and BcCCP suggest that these 
enzymes have distinctive specificities. Firstly, it is worth noting 
that we are comparing a ligand complex of CP-A with unli- 
ganded BcCCP This is relevant in that the a6-al loop displays 
a closed configuration in CP-A, positioned over the active site. 
The residues in these loops placed in the vicinity of the active 
site might interact with substrates, but the key determinants of 
specificity reside in the side-chain binding pocket, the SI' site 
itself (Figs. 9, 10 and 11), and are strongly influenced by only 
a few residues. In CP-A these are Ile243, Asnl44, Ala250, 
Gly253, Ile255, Asp256 and Thr268. Ile243 and Ala250, 
together with Tyr248, are in the a6-al loop capping the SI' 
pocket. Asnl44 and Thr268 are conserved as Asn235 and 
Thr342 in BcCCP part-way down to the bottom of the SI' site. 
Ile255 in CP-A corresponds to Ser329 in BcCCP and the 
hydroxyl group makes the bottom of the Sl r subsite polar. 
Nearby, two differences, Gly253 and Asp256 in CP-A but 
Leu327 and Lys330 in BcCCP, make the the SI' subsite much 
more shallow in the bacterial enzyme and replace an acidic 
residue with a basic one (Fig. 10). In bovine CP-A Asp256 
directly interacts with a side chain of substrate (Christianson 
& Lipscomb, 1986) and by providing an acidic hydrogen-bond 
accepting capability helps to define the specificity of the 
enzyme. The side chain of Lys330 in BcCCP extends into the 
space occupied by Ile243 and Asp256 in CP-A (data not 



shown). Lys330 of BcCCP is strictly conserved in BmCCP and 
therefore the SI' subsite in the Burkholderia enzymes is 
perfectly matched to bind glutamic acid, consistent with the 
putative function as a polyglutamylase. The lysine is replaced 
by an asparagine (Asn321) in both SdCCP and PaCCP, so the 
SI' subsite would still be cabable of binding glutamate. 

To test BcCCP for deglutamylase activity, in vitro assays 
were performed using a furylacryloyl glutamate derivative 
(data not shown). Only low levels of this substrate were 
cleaved, indicating that BcCCP might not be a deglutamylase, 
that the tested 'substrate' is not appropriate or that the 
enzyme might require activation, as is the case for bovine 
CPA. 

3.5. Issues regarding druggability 

The concept of a druggable protein concerns the properties 
of a binding site on the target to bind molecules that possess 
the right physicochemical attributes to make them bioavail- 
able and have a high affinity for the target and a low toxicity 
to humans (Hunter, 2009; Fauman et al, 2011). The optimal 
target would be a small, well defined and ordered cavity in the 
protein with pronounced hydrophobic components, although 
there are notable exceptions. The active sites of enzymes such 
as pteridine reductase (PDB entry 2x9g; Dawson et al, 2010) 
and dihydrofolate reductase (PDB entry 3cl9; Schormann et 
al, 2008) would be described as 'druggable'. The volumes of 
their active sites are estimated as 350 and 380 A 3 , respectively. 
Based on the crystal structures of the proteins discussed here 
the substrate-binding cavity of bacterial CCP is estimated to 
occupy a volume of approximately 310-410 A 3 , which would 
be judged of a sufficient size to bind drug-like molecules (An 
et al, 2005). 

Although size matters, the chemical properties of the active 
site are also critical. In this respect, we note specific functional 
groups (e.g Lys330) and hydrophobic surfaces (tyrosines) that 
might interact with potential inhibitors. There is, in addition, 
the presence of the catalytic Zn 2+ that provides opportunities 
to incorporate metal-ion affinity into inhibitory molecules. 

4. Concluding remarks 

The structure of BcCCP has been determined. Only small 
crystals with a relatively long unit-cell c parameter of about 
290 A could be obtained. However, by exploiting the advan- 
tages of a microfocus synchrotron beamline we were able to 
obtain high-resolution diffraction data. An unconventional 
approach to refinement was adopted by exploiting the 
strengths inherent in the automated protocols of 
PDB_REDO. This ensured efficient refinement of an asym- 
metric unit containing over 1500 residues and a large number 
of water molecules. Our experience was positive and points to 
a future in which such an approach might become standard. 
The structure is compared with those of other CCP proteins 
and with that of CP-A. The CP domain of BcCCP is similar to 
other M14 family CPs, while the N-terminal domain assembles 
into a fold unique to CCP subfamily proteins. Our data 
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provide details of the active site and reveal the spatial 
conformation of residues involved in Zn 2+ coordination and 
reaction catalysis. Specific differences localized to the SI/ 
pocket on one side of the active site suggest that the bacterial 
CCP proteins process substrates based on the recognition of 
an acidic residue. This would be commensurate with their 
assignment as polyglutamylases, although further experiments 
are required to address this issue. 

This research was supported by awards from the Biotech- 
nology and Biological Sciences Research Council, The Well- 
come Trust (grant Nos. 082596, 094090 and 100476) and the 
European Commission Seventh Framework Programme 
(grant agreement 223461, Aeropath) to WNH. RPJ is 
supported by Veni grant 722.011.011 from the Netherlands 
Organization for Scientific Research. We thank the European 
Synchrotron Radiation Facility, Grenoble for excellent 
support. 



References 

An, X, Totrov, M. & Abagyan, R. (2005). Mol. Cell. Proteomics, 4, 
752-761. 

Bond, C. S. & Schiittelkopf, A. W. (2009). Acta Cryst. D65, 510-512. 
Chen, V. B., Arendall, W. B., Headd, J. J., Keedy, D. A., Immormino, 

R. M., Kapral, G. X, Murray, L. W., Richardson, X S. & Richardson, 

D. C. (2010). Acta Cryst. D66, 12-21. 
Cho, X H., Kim, D. H., Lee, K. X, Kim, D.-H. & Choi, K. Y. (2001). 

Biochemistry, 40, 10197-10203. 
Christianson, D. W. & Lipscomb, W. N. (1986). Proc. Natl Acad. Sci. 

USA, 83, 7568-7572. 
Christianson, D. W. & Lipscomb, W. N. (1989). Acc. Chem. Res. 22, 

62-69. 

Cruickshank, D. W. X (1999). Acta Cryst. D55, 583-601. 

Dawson, A., Tulloch, L. B., Barrack, K. L. & Hunter, W. N. (2010). 

Acta Cryst. D66, 1334-1340. 
DeLano, W. L. (2002). PyMOL. http://www.pymol.org. 
Eadsforth, T. C, Gardiner, M, Maluf, F. V., McElroy, S., James, D., 

Frearson, X, Gray, D. & Hunter, W. N. (2012). PLoS One, 7, e35973. 
Emsley, P., Lohkamp, B., Scott, W. G & Cowtan, K. (2010). Acta 

Cryst. D66, 486-501. 
Engh, R. A. & Huber, R. (1991). Acta Cryst. A47, 392-400. 
Evans, P. (2006). Acta Cryst. D62, 72-82. 

Faming, Z., Kobe, B., Stewart, C B., Rutter, W. X & Goldsmith, E. X 

(1991). /. Biol Chem. 266, 24606-24612. 
Fauman, E. B., Rai, B. K. & Huang, E. S. (2011). Curr. Opin. Chem. 

Biol 15, 463-468. 



Gardell, S. X, Craik, C S., Hilvert, D., Urdea, M. S. & Rutter, W. X 

(1985). Nature (London), 317, 551-555. 
Gasteiger, E., Hoogland, C, Gattiker, A., Duvaud, S., Wilkins, M. R., 

Appel, R. D. & Bairoch, A. (2005). The Proteomics Protocols 

Handbook, edited by X M. Walker, pp. 571-607. Totowa: Humana 

Press. 

Gomis-Ruth, F X. (2008). Crit. Rev. Biochem. Mol. Biol. 43, 319-345. 

Hunter, W. N. (2009). /. Biol Chem. 284, 11749-11753. 

Joosten, R. P, Joosten, K., Cohen, S. X., Vriend, G & Perrakis, A. 

(2011). Bioinformatics, 27, 3392-3398. 
Joosten, R. P, Joosten, K., Murshudov, G N. & Perrakis, A. (2012). 

Acta Cryst. D68, 484-496. 
Kabsch, W. (1976). Acta Cryst. A32, 922-923. 

Kalinina, E., Biswas, R., Berezniuk, L, Hermoso, A., Aviles, F X. & 

Fricker, L. D. (2007). FASEB J. 21, 836-850. 
Kim, H. & Lipscomb, W. N. (1990). Biochemistry, 29, 5546-5555. 
Kimura, Y., Kurabe, R, Ikegami, K., Tsutsumi, K., Konishi, Y., 

Kaplan, O. L, Kunitomo, H., lino, Y, Blacque, O. E. & Setou, M. 

(2010). /. Biol Chem. 285, 22936-22941. 
Krissinel, E. & Henrick, K. (2004). Acta Cryst. D60, 2256-2268. 
Krissinel, E. & Henrick, K. (2007). /. Mol. Biol. 372, 774-797. 
Leslie, A. G W. (2006). Acta Cryst. D62, 48-57. 
Mangani, S., Carloni, P & Orioli, P (1992). /. Mol. Biol. 223, 573-578. 
McCoy, A. X, Grosse-Kunstleve, R. W., Adams, P D., Winn, M. D., 

Storoni, L. C & Read, R. X (2007). /. Appl. Cryst. 40, 658-674. 
Moynie, L., Schnell, R., McMahon, S. A., Sandalova, T., Boulkerou, 

W. A., Schmidberger, X W., Alphey, M., Cukier, C, Duthie, F, 

Kopec, X, Liu, H., Jacewicz, A., Hunter, W. N., Naismith, X H. & 

Schneider, G (2013). Acta Cryst. F69, 25-34. 
Murshudov, G N., Skubak, P, Lebedev, A. A., Pannu, N. S., Steiner, 

R. A., Nicholls, R. A., Winn, M. D., Long, F & Vagin, A. A. (2011). 

Acta Cryst. D67, 355-367. 
Otero, A., Rodriguez de la Vega, M., Tanco, S., Lorenzo, X, Aviles, 

F X. & Reverter, D. (2012). FASEB J. 26, 3754-3764. 
Painter, X & Merritt, E. A. (2006). /. Appl. Cryst. 39, 109-111. 
Rawlings, N. D., Barrett, A. X & Bateman, A. (2012). Nucleic Acids 

Res. 40, D343-D350. 
Rodriguez de la Vega, M., Sevilla, R. G, Hermoso, A., Lorenzo, X, 

Tanco, S., Diez, A., Fricker, L. D., Bautista, X M. & Aviles, F X. 

(2007). FASEB J. 21, 851-865. 
Rogowski, K. et al. (2010). Cell, 143, 564-578. 

Sahab, Z. X, Hall, M. D., Sung, Y M., Dakshanamurthy, S., Ji, Y, 
Kumar, D. & Byers, S. W (2010). Cancer Res. 71, 1219-1228. 

Schormann, N., Senkovich, O., Walker, K., Wright, D. L., Anderson, 
A. C, Rosowsky, A., Ananthan, S., Shinkre, B., Velu, S. & 
Chattopadhyay, D. (2008). Proteins, 73, 889-901. 

Sievers, F, Wilm, A., Dineen, D. G, Gibson, T. X, Karplus, K., Li, W, 
Lopez, R., McWilliam, H., Remmert, M., Soding, X, Thompson, 
X D. & Higgins, D. G (2011). Mol. Syst. Biol. 7, 539. 

Winn, M. D. et al. (2011). Acta Cryst. D67, 235-242. 

Winstanley, C et al. (2009). Genome Res. 19, 12-23. 



Acta Cryst. (2014). D70, 279-289 



Rimsa et a/. • M14-type cytosolic carboxypeptidase 289 



